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Abstract 

There are considerable improvements over the years in both Bioinformatics 
and Computer Sciences leading to multiple breakthroughs in the respective 
fields. As bioinformatics is about finding new ways to analyze the data (huge 
data) for logical conclusions. Artificial Intelligence can be used to analyze 
process and categorize the gigantic amount of biological data in less time. 
Numerous AI algorithms have been developed and used in bioinformatics 
analyses. Such algorithms are used to generate knowledge in an efficient way. 
The algorithms based on the applications of AI helps in molecular dynamic 
simulations, discovering vaccines for diseases, molecular docking analyses, 
identifications of novel compounds, ADMET properties, machine and deep 
learning of tools and in silico structure prediction. This review article 
summarizes the applications of AI that employed in bioinformatics; 
specifically, genetic algorithm in bioinformatics tools. 
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Introduction 


From last decade, there has been much progress in 
bioinformatics and Artificial Intelligence (AI), and 
more opportunities are available to understand the 
biological data and problems. Bioinformatics is an 
interdisciplinary field of science utilized to solve 
numerous biological problems by employing 
mathematical models, statistical approaches, and 
algorithms along with computational powers [1]. AI is 
the ability of a computational system to perform 
different tasks linked with intelligent beings and as a 
simulation of human intelligence processes by the 
computer systems [2]. AI is the ability of a machine to 
abstract, be creative and perform the tasks based on its 
training [3]. Bioinformatics approaches have been 
utilized to solve numerous biological problems [4]. 
The structural bioinformatics tools have the 
applications of AI and have effective methodologies 
to design active novel compounds against 
neurological disorders [5-11] and cancer [12-17] 
through in silico means by using the tools having AI 
applications. Bioinformatics approaches are used to 
analyse the biological data for logical conclusions. 
The projects of whole genome sequencing are 
generating an enormous amount of biological data 
related and bioinformatics approaches help to solve 
and annotate the data in meaningful ways. Numerous 
biological problems have been solved by employing 
the approaches of AI along with bioinformatics 
approaches and help to design effective algorithms for 
gene prediction, computational drug designing, 
protein-protein interactional studies, genome wide 
association studies, next generation sequences and 
software development. AI has wide range of 
application in bioinformatics [18, 19]. The efficiency 
of Al improves by modifying its code according to the 
input data. AI is classified into two types as 
generalized AI and Applied AI. The generalized AI is 
the development of AI in which the machines perform 
the tasks as humans do while applied AI is the 
development that can stimulate as human-like 
thoughts and expressions [20-22]. AI and heuristic 
methods showed significance for the present and 
future developments of bioinformatics and having the 
potential for a revolutionary impact on biotechnology, 
pharmacology and medicine [23, 24]. Now a day, 
applied AI helps in the prediction of personalized 
medicines by analyzing the genomic data of the 
patients along with the diagnoses of the patients. The 
algorithms base on AI has the ability to analyze the 
data efficiently as compared to the algorithms having 
no AI training. The algorithms of 
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AI including linear regression, logistic regression and 
K-Nearest neighbors’ algorithm in bioinformatics 
improve the efficiency of the computational biology 
to solve the biological problems [25, 26]. 


Application of Al in bioinformatics 


There is a gigantic amount of data in biological 
databases and the annotation of this data is still a 
question for the researchers. The applications of AI in 
bioinformatics have the ability to annotate the data 
towards logical conclusions [27]. The simulations of 
different models, annotations of biological sequences, 
computational drug designing, virtual screening and 
gene prediction can efficiently predict through the 
association of AI and bioinformatics [28]. The major 
contribution of AI in bioinformatics analyses depends 
on pattern matching and knowledge-based learning 
systems to solve the biological problems. 

The advancement in AI and bioinformatics has 
significant contribution in health sciences including 
immunoinformatics [29] and vaccinology and leads to 
the clinical bioinformatics, high-throughput 
screening, disease prevention and epidemiology. The 
development of new vaccines is becoming more 
difficult due to the increased rate of mutations in 
microbes and viruses. The advancement helps to 
improve the power and algorithms of computational 
simulation. Now a day, computational systems have 
the capacity to screen the vaccine targets from 
>20,000 flavivirus proteins to >100,000 influenza 
proteins. The generated data can be interpreted by 
using different methods results to logical conclusions. 
Researchers designed in silico methodologies to 
discover vaccines (Figure 1). 

The success of AI in bioinformatics has widely used 
algorithms and methodologies including neural 
networks, probabilistic approach, decision trees, 
cellular automata, hybrid methods and genetic 
algorithms to solve numerous biological problems 
[30, 31]. 


Genetic Algorithm 


A genetic algorithm is a _ heuristic-based search 
algorithm inspired by the natural evolution theory of 
Charles Darwin. The genetic algorithm follows the 
idea of natural selection to select the fittest survival 
and the significant output [32]. The genetic algorithm 
is designed to perform the tasks in five major phases 
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Fig. 1: Workflow of vaccine discovery process including bioinformatics tools and reasoning. 


(Figure 2) and select the fittest iteration from the 
generation [33]. 

Initially, the first population generates a set of 
individuals known as population and each individual 
have genes (defined by a set of parameters). The 
combinations of defined set of genes form a 
chromosome. The fitness function determines the 
efficiency of an individual and calculates a score for 
each individual. The calculated score determines the 
selection of an individual leads to the selection of the 
fittest individual based on the fitness score for the 
progeny. The parents are selected for the reproduction 
through this process. The crossover is considered as 
the significant step for each pair of parents for the 
random selection of the genes results to the generation 
of offspring. There is a probability that the strings of 
the genes can flip for the generation of new offspring 
at mutation phase. The algorithm terminates by the 
repeated formation of the same generations, 
considered as the final product [34]. 


Initial Population 


Fitness Check 


Crossover 


Fig. 2: Flowchart of the major followed phases of the 
Genetic Algorithm. 
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Fig. 3: The flowchart of the Genetic Algorithm. 


Genetic Algorithm is utilized to effectively enhance 
the multiple sequence alignment [35, 36]. This 
approach uses a population of alignments to generate 
a fitness score based on the matching and 
mismatching of the columns [37-39]. Half of the 
suitable alignments are copied to the next generation 
leads to the crossover points to choose a cut for the 
random selection point in the first alignment sequence 
and another cut is made for the second alignment to 
adjust the first sequence. One parent is spliced to add 
the gaps leading to the splicing of another parent to 
add the gaps to ensure the alignment consistency [40- 
42]. Genetic algorithm predicts efficient alignments 
results as compared to other alignment algorithms [43- 
45]. The applications of AI in bioinformatics play an 
important role to solve the biological problems 
computationally [46-50]. 


Conclusion 


In future, Bioinformatics will play a significant role to 
analyze the huge data sets by using AI leads to save 
the time and resources. It will also help to accelerate 
the biological discoveries, especially in medicine, 
biomedical fields and robotic surgery. 
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